A test for the rank of the volatility process: the random 

perturbation approach 

Jean Jacod * Mark Podolskij t 

<N 

' December 24, 2012 

D ' 
Q 

Abstract 

, In this paper we present a test for the maximal rank of the matrix- valued volatility 

Xj} ' process in the continuous Ito semimartingale framework. Our idea is based upon a 

random perturbation of the original high frequency observations of an Ito semimartin- 
gale, which opens the way for rank testing. We develop the complete limit theory for 
the test statistic and apply it to various null and alternative hypotheses. Finally, we 
demonstrate a homoscedasticity test for the rank process. 
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^ ; 1 Introduction 

(N , 

In the last years asymptotic statistics for high frequency observations has received a lot of 
attention in the literature. This interest was mainly motivated by financial applications, 
^ \ where observations of stocks or currencies are available at very high frequencies. As under 

the no-arbitrage condition prices processes must be semimartingales (see e.g. [1]), a lot of 
research has been devoted to statistics of high frequency data of semimartingales. We refer 
to a recent book [10] for a comprehensive study of infill asymptotic for semimartingales. 

This paper is devoted to testing for the maximal rank of the matrix- valued volatility 
process in the continuous Ito semimartingale framework, and more specifically for a d- 
dimensional continuous Ito semimartingale X which is observed at equidistant times over 
a fixed time interval [0, T]: we observe (AjA„)o<i<[T/A n ] ; an d the high-frequency approach 
consists in assuming A n — > 0. 
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A continuous Ito semimartingale can be written as 

dX t = b t dt + a t dW t , (1.1) 

where W is a Brownian motion, and there are many representations of this form, with 
different Brownian motions W and, accordingly, different volatility processes a. What 
is "intrinsic" is the drift coefficient bt and the diffusion coefficient ("squared volatility") 
Cf = atCti m the sense that they are uniquely determined by X, up to a Lebesgue-null set 
of times (throughout the paper a* denotes the transpose of the matrix a). 

For modeling purposes and economical interpretation we would like to find, and often 
choose, the smallest possible dimension of the Brownian motion W in the representation 
(|l.ip . Assuming further that 1 1->- ct is continuous, this smallest possible dimension is the 
supremum in time of the rank of the R rfxrf -valued process c over the time interval [0,T]. 
We are further interested in homoscedasticity testing for the rank process. 

A partial answer to this question was given in [9]. The authors of this paper studied the 
problem of testing the null hypothesis sup tg r T \ rank(ct) > vq against sup tg [ T ) rank(c t ) < 
7"o for a given number tq. However, their method does not extend to testing null hypotheses 
of other types, e.g. sup 46 [ 0j r) rank(cj) = tq against sup tg j 0j T) rank(c 4 ) ^ ro (which is much 
more useful). In the classical setting of i.i.d or weakly dependent data various estimation 
methods for the rank of an unknown covariance matrix (and related objects) have been 
proposed. We would like to mention Gaussian elimination method with complete pivoting 
of [3] and the test suggested in [13] among others. Unfortunately, these procedures can 
not be applied to our statistical problem as the probabilistic structure of the process X is 
more complex and the rank is time- varying. 

Our method is based upon a random perturbation of the original data and determinant 
expansions. The main idea can be described as follows: if we compute det (ct + het) for 
a positive definite d x d matrix e t independent of q and h X 0, then, under appropriate 
conditions, its rate of decay to depends on the unknown rank of c±. Hence, the ratio 
det (ct + 2het)/ det (ct + het) asymptotically identifies the rank of cj. Indeed, our main 
statistic is a partial sum of squared determinants of matrices build from d consecutive 
increments of the process X and the random perturbation is performed by a properly 
scaled Brownian motion W' , which is independent of all ingredients of X. We remark that 
perturbation methods (and matrix expansions as well) find applications in various fields 
of mathematics; we refer for instance to [TT] whose authors apply matrix perturbation 
methods to determine the number of components in a linear mixture model from high 
dimensional noisy samples. Furthermore, the methods of [2] also rely upon a generation 
of a new Brownian motion W 1 although in a completely different setting. 

The paper is structured as follows. Section [2] is devoted to model assumptions, testing 
hypotheses and test statistics. We present the asymptotic theory for our estimators and 
apply it to maximal rank testing in section [3l In section H] we develop a test for the null 
hypothesis of constant rank. All proofs are deferred to section [5j 
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2 Model, assumptions and a random perturbation 

2.1 The setting and testing hypotheses 

Our process of interest is a d-dimensional continuous Ito semimartingale X, given on 
some filtered probability space (Q, F, (J^)t>o,P). In vector form, and with W denoting a 
(/-dimensional Brownian motion, it can be written as 

X t = X + [ b s ds+ [ a s dW s , (2.1) 
jo Jo 

where bt is a (i-dimensional drift process and at is a R dx,? -valued volatility process, assumed 
to be continuous in time (and indeed much more, see Assumption (H) below). We set 

c t = o t a* t , r 4 = rank(c t ), R t = sup r s . (2.2) 

se[o,t) 



We remark that the maximal rank Rt is not bigger than the rank of the integrated 
volatility Cfdt, but may be strictly smaller. As already mentioned, it is suitable to use 
the smallest possible dimension for W, on the time interval [0, T\. This is the P-essential 
supremum of u i— > Rt(oj), but, since a single path t t- > X t (uj) is (partially) observed, 
the only available information is Rt itself. So the problem really boils down to finding 
the behavior of the process rj, and for this the choice of the dimension of W in (12. lj) is 
irrelevant. 

The rank rt is the biggest integer r < d such that the sum of the determinants of 
the matrices (cf)ij & j, where J runs through all subsets of {I,--- , d} with r points, is 
positive (with the convention that a x matrix has determinant 1); see e.g. [9J Lemma 
3]. Since q is continuous, this implies that for any r the random set {t : rt(u>) > r} is 
open in [0, T), so the mapping £ i — > is lower semi-continuous. In particular, the set 
{t € [0,T) : rt{uj) = Rt(uj)} is a non-empty open subset. These properties also yield that 
the process r t is predictable and that the following subsets of f2, which later will be the 
"testing hypotheses" , are Jr-measurable: 

{uj : Rt{oj) = r} 

{u : r t {uj) = R t {oj) for all t G [0, T}} 

{oj : 1 1— > rt(co) has finitely many discontinuities and is 

not Lebesgue-a.s. constant on [0,T]}. 

Notice that we impose that = Rt m ^-t > whereas the lower semi-continuity only implies 
in general that < Rt- Observe also that a priori t\-¥Tt may be Lebesgue-a.s. constant 
and still have discontinuities (even infinitely many) on [0, T\. So, and Vtj, are disjoint 
but 0,^ U in general. The main aim of this paper is testing the null hypothesis Q T 

against Sl^ r = U r '^ rj o< r '<d^T ( an d related hypotheses) and testing the null hypothesis of 
fif. against Oji. 
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2.2 Matrix perturbation 

In order to explain the main idea of our method, we need to introduce some notation. 
Recall that d and q are the dimensions of X and W, respectively. Then Ai is the set of 
all d x d matrices, Ai r for r G {0, • • • , d} is the set of all matrices in Ai with rank r, and 
Ai' is the set of all d x q matrices. For any matrix A we denote by Ai the ith column of 
A] for any vectors x\,...,x^ in M d , we write mat(xi, • • • , x^) for the matrix in Ai whose 
ith column is the column vector Xj. For r G {0, • • • ,d} and A, B G Ai we define 

■A^B = {C € .M : Gi = Ai or G % = B t with #{i : G; = AJ = r}. (2.4) 

In other words, Ai r A B is the set of all matrices G G Ai with r columns equal to those of 
A (at the same places), and the remaining d — r ones equal to those of B. Let us define 

j r (A,B)= det(G). (2.5) 



We demonstrate the main ideas for a deterministic problem first. Let A G Ai be an 
unknown matrix with rank r. Assume that, although A is unknown, we have a way of 
computing det(A + hB) for all h > and some given matrix B G Aid- The multi-linearity 
property of the determinant implies the following asymptotic expansion 

det{A + hB) = h d - r lr {A, B) + 0{h d ~ r+1 ), (2.6) 

which is the core of our method. Thus, if j r (A, B) ^ 0, we have 

det{A + 2hB) ,_ r 



det(A + hB) 



as h I 0. (2.7) 



and this convergence identifies the parameter r. However, it is impossible to choose a 
matrix i? G Ai which guarantees r y r {A,B) ^ for all A G Ai r . To solve this problem 
we can use a random perturbation. As we will show later, for any A G Ai r we have 
7 r ( J 4,£?) 7^ a.s. when i? is the random matrix whose entries are independent standard 
normal. This idea will be the core of our testing procedure. 



2.3 Assumptions and the test statistic 

Before we proceed with the definition of the test statistic, we introduce the main assump- 
tions. We need more structure than the mere Equation (|2.ip . namely that the processes 
bt and at, and also the volatility of <r t , are continuous ltd semimartingales. In view of the 
previous discussion, it is no restriction to assume that all these are driven by the same 
g-dimensional Brownian motion, provided we take q large enough. This leads us to put 



Assumption (H): The <i-dimensional semimartingale X, defined on (CI, J 7 , {J-t)t>o, P) ; 
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has the form 

X t = X + J* b s ds + / * a s dW s 

°t = °o + Jo a * ds + Jo* u * r 2 g ) 

^ = &0 + Jo a 's ds + Jo V 's dW s 

v t = VQ + J * a" (is + J * i# e?W s , 

where W is a ^-dimensional Brownian motion, and bt and a' t are M d - valued, <rt, at an d 
t are M dXlJ -valued, v t and a" are ]R dXlJX ' ? -valued, and v" is ]R dx<JX,?x ' 3 -valued, all those 



v' 



processes being adapted. Finally, the processes at,v' t ,v' t ' are cadlag and the processes 
a! t ,a'l are locally bounded. □ 



At this stage it is not quite clear why the full force of assumption (H) is required. In 
the standard limit theory for high frequency data of continuous ltd semimartingales, see 
e.g. [UEj, only the first two representations of (|2.8p are assumed. We will further explain 
condition (H) once we introduce the test statistic. When bt = gt(X t ), at = gi (Xt) with 
gi € C 2 (M. d ) and 52 £ C 4 (M d ), then (H) is automatically satisfied, due to Ito's formula. 

Remark 2.1 Since at is not uniquely specified, whereas q is, and since we really are 
interested in specific properties of q, it would be much nicer to replace the structural 
assumption on at (second equation in (|2.8|) ) by a similar assumption on the process Cj 
itself. 

This is of course a trivial matter when ct is everywhere invertible: in this case q is 
a continuous Ito semimartingale if and only if at is. But here we are precisely trying to 
describe the rank of the matrix q, so it is out of the question to assume that it is a priori 
invertible. Unfortunately, we were unable to replace the assumption on a by a similar 
(and de facto weaker) assumption on c. □ 

Motivated by the matrix perturbation at (|2.6p . our tests will be based on statistics 
involving sums of (squared) determinants. The test function will be the nonnegative map 
/ on (R d ) d defined as 

/(xi, ...,x d ) = det(mat(xi, • • • ,x d )) 2 . (2.9) 

The authors of [9] used the following statistics 
[t/A n ]~d+l 

A n Yl f{^X/^--- ,^ d _ 1 X/ y /\), AfX = X iAn -X (i _ 1)An , (2.10) 

i=l 

to test for the full rank, thus allowing for efficient testing of the null hypothesis 0^. On 
the sets £l r T with r < d, however, it exhibits complex degeneracies and becomes difficult to 
study. In order to be able to analyze the asymptotic behavior of the preceding statistics, 
we introduce a random perturbation of the original process X as motivated at the end of 
Subsection 12.21 (a somewhat similar idea in a different context was applied in [2]). More 
specifically, we choose a non-random invertible dx d matrix a and generate a new process 



X[ = aWi 



A test for the rank of the volatility process 



6 



where W' is a d-dimensional Brownian motion independent of all processes in (12. 8p (without 
loss of generality, for the mathematical treatment below we may assume that it is also 
defined on the space ($7, T, (Ft)t>o, P))- Following the ideas of section [221 we add to 
X this new process X' , with a multiplicative factor going to 0. As a matter of fact we 
introduce two such additions, and for k = 1 or 2 we set 



Z t n ' K = X t + y^KnX' t . (2.11) 

Hence, with the notation of section [272], we use h = V A n , which leads later to the optimal 
rate of convergence. 

Another problem arises, namely in (12.10p successive summands partly use the same 
increments of X, and this causes problems for the Central Limit Theorem. These problems 
can actually be overcome, at the expense of quite many additional technicalities, and with 
the advantage of a smaller asymptotic variance for our estimators below. However, in our 
case the crucial point is the choice of the tuning "parameter" a: this choice has an impact 
on the asymptotic variance as well, and since an "optimal" choice of a seems out of reach, 
we will content ourselves with an arbitrary choice of a and with a version of (|2.10p with 
no overlapping of increments between the successive summands. This leads us to use the 
following two basic statistics: 

[t/2dA n ]-i / _ 7-".! _ 7 n >i \ 

cn,l _ 2rfA , / Z, (2id+l)A TC Zj 2idA n Zj {2id+d)A n Zj (2id+d-l)A n \ ^ ^ 

[t/2dA n ]~ 1 / yn,\ _ yn,\ jri,\ _ yn,l \ 

-,n,2 _ o , A A Zj (2id+2)A n Zj (2id)A n Zj (2id+2d)A n Zj (2id+2d-2)A n \ 



S^ 2 = 2dA n £ / 



i=0 



Notice that the statistics S 1 ™' 1 and S"™' 2 are essentially the same, except 5™' 2 is computed 
using the frequency 2A n . At stage n one observes the increments A" A and simulates the 
increments AfX' for i < [t/A n ], so one "observes" all variables incurring in the definition 
of these two statistics. 



Remark 2.2 Now, let us explain why the assumption (H) and the random perturba- 
tion in (|2.1ip are required. A direct stochastic expansion of the increments AfZ 71 under 
assumption (H) implies the decomposition 

mat(A^"/ v / A;, . . . , A^Z"/^) = a? + sfA~ n {^{!) + #(2)) + O p (A„), (2.13) 

where the matrices af = mat (a" 1 , . . . , af d ), (3?(k) = mat(/3" 1 (fc), . . . , P™ d {k)), k = 1,2, in 
M are given by 

a& = A; 1/2 f T (j _ 1)Ari Af +J _ 1 ^, 

= VDA„ + A" 1 w (t _ 1)Af , /(^f)^ " W (i+J . 1)An )dW s (2.14) 

/3^.(2) = A- 1/2 5Af +J _ 1 ^'. 

We remark that the matrices a™, /3f(l), /3™(2) are Op(l). In the case rt < d— 1 for all t, the 
first order term af, which depends on the process at, gives a degenerate limit when plugged 
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in into the statistics (|2TLj) or (l2~T2l . Hence the second order term y/~K^(P?{\) + /3f(2)), 
which involves the processes bt and vt, becomes important. Indeed, we will see in section 
[3] that it affects the limits. Furthermore, it is important to control the error of the above 
decomposition, and this is done by using the last two equations in (|2.8|) . 

The asymptotic expansion in (|2. 13|) is a stochastic analogue of the perturbation pre- 
sented in (12. 6p (up to an error term) with A = a™, B = /3™(1) + (3™ (2) and h = y^n- 
Under assumption (H) the term /3™(1) already constitutes a random perturbation of the 
leading matrix a™. However, this perturbation does not guarantee that the quantity 
7 r (a™, /3"(1)) defined in (|2.5[) does not vanish when rank(<r ( -j_ 1 )^ n ) = r (which is essential 
for our method). To illustrate this problem let us give a simple example. Let d = 3, q = 1 
and define the processes 

dx{ = 4dW t , do{ = v{dW t , j = 1, 2, 3. 

(so W is a one-dimensional Brownian motion.) Then rank(a") = 1, rank(/3™(l)) = 1, and 
hence 7i(a™, f3f (1)) = 0. The presence of the new independent process X' , and thus of 
the term /3f (2), regularizes the problem. Indeed, we will show that 7r(af (1) + /3J 1 (2) ) 
does not vanish whenever rank(<j(j„i)A„) = r. Finally, the perturbation rate h = \J A n in 
front of the process X' is chosen to achieve the best rate of convergence for the normalized 
versions of the statistics S 1 ™' 1 , S'™' 2 . □ 

Following the expansion (12.61) we know that that the order of det(a™ + V A n (/3™(1) + 
(3f(2))) 2 is increasing in r = rank(ci(j_ 1 )A n )- Consequently, as in (|2.7p . the ratio S^/S^ 1 
is expected to identify (asymptotically) the maximal rank Rx- The complete asymptotic 
theory is presented in the next section. 



3 The asymptotic results and test for the maximal rank 
3.1 Notation 

In order to present the main asymptotic results we need to introduce a few more notation. 
We define the function F r on (M 2 ^ by 

F r (vi, . . . ,v d ) = 7 r (mat(xi, • • • , x d ), mat(yi, • • • ,y d )) 2 if vj = ( X J J G M. 2d . (3.1) 

Next, let U = M' x M x R dq2 x R d , whose points are u = (a, (3, 7, a), where a G and 
(3 £ Ai and 7 G R dq and a G Let us denote by W and W two independent Brownian 
motions with respective dimensions q and d, defined on some space (O, J 7 , (J 7 ^), P). If 
u£lA and k = 1, 2 and i > 1 we associate the 2d-dimensional variables with the following 
components for I = 1, • • • , d: 

«)| = ^ E^=i « /rr W* - WT (i _ 1} ) 

*(u, K )f + ' = a' + ^ El=i /? /m (W- - TT'-_ 1} ) + I £^ =1 7 ,mfc W k s dW?. 

(3.2) 
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With the notation (13. ip we can then define the variables 

F r (u,K) =F r (*(tt,«) 1} ... ,tf(u, «)«*)■ (3.3) 

The two sequences (^(u, k))«>i are not independent, but they have the same (global) law, 
for k = 1,2. Therefore if u = (a, /3, 7, a) we can set 

T r (u) = E(F r (u, 1)) = E(F r (u, 2)) 

r;(u) = I(F r (u, 1) 2 ) - r r (u) 2 = I(F r (u, 2) 2 ) - r r (u) 2 (3.4) 

r;'(u) = I(F r (u, 1) F r (u, 2)) - T r (n) 2 . 
We then obtain the following crucial properties 

Lemma 3.1 Let u = (a, f3, 7,0) G U with j3 £ Md- Then if r & {0, 1, • • • , <i}, 

rank(a) = r =^ r r (u) > 0, T' r (u) > T"(u) (3.5) 
rank(a) < r => r r (u) = r^(u) = T^(u) = 0. 

3.2 The limiting results 

The key result is the asymptotic behavior of the processes S nj as n — > 00. These processes 
enjoy a Law of Large Numbers and a Central Limit Theorem, the centering being around 
one of the following processes, where r is any (fixed) integer between and r: 



S(r) t = / T r (a s ,a,v s ,b s )ds . 
Jo 



(3.6) 



We will in fact have a CLT for the two-dimensional processes U(r) n with components 

U( r ) n >" = -L= ( - - \ , S n ' K - S(r)) . (3.7) 

Of course, the centering process S(r) depends on r, so one needs an additional assumption 
related with the particular value of r which is chosen below (in contrast, the centering 
term is the same for all components): 

Theorem 3.2 Assume (H), and also that rt(uj) < r identically for some r € {0, • • • , d}. 
Then we have the stable (functional) convergence in law 

U(r) n U(r), (3.8) 

whereU(r) = {U(r) K ) K= i^ is defined on an extension (CI, J 7 , (J 7 t)t>o,^') of (CI, J 7 , (J r t)t>o,P) 
and is, conditionally on T , a continuous centered Gaussian martingale with conditional 
covariance 

{2df^T'(a s ,a,v s ,b s )ds if k = k' 
2d J Q r"(a s ,a, v s , b s ) ds if k 7^ K. 
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Note that in the above setting, if r < r' < d, we also have rt < r' and thus the results 
also hold with r' instead of r everywhere. This does not bring a contradiction because, by 
(|3.5h . in this case the processes S(r') and U(r') are identically vanishing. 

Now, these processes S n ' J are only tools, and at the end we will be interested, for any 
T > fixed, in "estimators" for Rt, which are 

^Tl^- '^ff'' (3.10) 
log 2 



The quantity is a transformed analogue of the term on the left side of (]2.7p . The following 
corollary is then a simple consequence of the previous theorem: 

Corollary 3.3 Assume (H), and let r £ {0, • • • ,d} and T > 0. Then the following stable 
convergence in law holds: 

(R(n,T)-r) ^4 S(T) on the set Q r T , (3.11) 



where S(T) can be realized as S(T) = (^(Ot ~~ U(r) T ) / S{r)T and is thus defined on 

an extension (fi, J 7 , (J r t)t>o, IP) of (fi, J-, (J~t)t>o^) an d i> s > conditionally on J 7 , a centered 
Gaussian variable whose conditional variance is 

E((S(T)) 2 | F) = V(T), 

where V(T) is a.s. positive and given by 



1 Vir) 1 / +V(r) 2 / - 2V(r) 



1,2 

r 



V(T) = - -2 t~~ / j on each set Vt r T . (3.12) 



(log 2)2 (S(r] 



T 



In order to make this result feasible, we need consistent estimators for V{T). For the 
denominator S{r) T we can of course take the square of A! n ~ d S^' 1 . As for the numerator, 
we need estimators for V{r) T ,K . Up to normalization, natural ones are as follows: 

[t/2dA n }-l / yn,K _ yn,n 

yn,KK' _ 4^2^ f j 3. 2id + K ) A « 2idA r , 

i=0 V 



V nA n 



Xf 



yn,K yTl,K 

Zj {2id+n')A n ~ A 2idA r , 



vVA n 



Proposition 3.4 Assume (H). 



Zj {2id+Kd)A n ~ Zj (2id^ r 


■re(d-l))A„ \ 


\/K,A n 




Zj (2id+K'd)A n ~ Zj {2id+n l 






(d-l))A„^ 




(3.13) 


we have for k, k' = 1, 2 





_^_ V n^ m 2d J @r' K 'ds, where 

T' r (a s ,a,v s ,b s ) + T r (a s ,a,v s ,b s ) 2 if k = k' ( 3 - 14 ) 
r"(os,a,v s ,b s ) + T r (a s ,a,v s ,b s ) 2 if k ^ k' . 



e 



r,K,K 
s 
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b) We have 

V(n,T) :=^— 



y™' 11 -\- 2 2 ( R ( n ^ T )- d )v^' 22 — 2 1+R ( n ' T )- d V^' 1: 
(S^ 1 log 2)2 



Remark 3.5 The numerator of the right side of (|3.12p is also 2(V(r) T 1 — V{r) T 2 ). There- 
fore we have 



yn,U _ 2 l + R(n,T)-d v n,l2 



V'{n,T) = -Z ^_ t ^ V{T) onthesetn r 



as well. However, V(n, T) > by construction (and it is even a.s. positive unless r< = 
identically on [0,T]), a property not shared by V'{n,T). □ 

Now, by the delta-method for stable convergence in law, the two previous results 
immediately yield: 

Corollary 3.6 Under (H) and for any T > we have 

R(n, T) — Rt c 



y/A n V(n,T) 



$, (3.16) 



where <I> ~ A/"(0, 1) is defined on an extension (O, (Ft)t>o-, P) o/ (Jl,^, (^)t>o,P) anc 
is independent of J 7 . 



3.3 Tests for the maximal rank 

So far, it seems that R(n,T) are estimators for the maximal rank R?, which equals r 
on the set £l T , and even feasible estimators if we use Corollary 13.61 In particular, this 
corollary seems to allow us to easily construct confidence intervals for R?. 

However, what precedes does not make much statistical sense: the parameter Rt to 
be estimated takes its values in {0, 1, • • • , d}, whereas the estimators R(n, T) are of course 
not integer-valued and can even be negative, or bigger than d. One could overcome this 
problem by taking the integer closest to R(n,T), say R'(n,T), and then use R"(n,Y) = 
V (R'(n,T) A d) as the final estimator. Note that R"(n,T) enjoys the same CLT as 
R(n, T) does, on each U T with 1 < r < d — 1, but of course not when Rt = or Rt = d, 
in which cases the limiting law of the normalized error is "half Gaussian and half a Dirac 
mass at 0". Furthermore, confidence intervals have little meaning in this context, except 
perhaps when the dimension of X is very large. 

So, it seems more appropriate here to do testing: we can test the null hypothesis that 
the path lies in Vt T for some r, against the alternative that it is in £l T for another specific 
r' 7^ r, or for all r' > r or all r' < r, or all r' ^ r. We may also use composite null 
hypotheses, such as being in £1 T for some r smaller, or bigger, than a given value tq. 
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We start with the problem of testing the null hypothesis Oy, against the alternative 
£lj, T = ^r'^r,o<r'<d^T- ^or any a & (0,1), and with z a being the symmetric a-quantile 
of J\f(0, 1) defined by P(|$| > z a ) = a when $ ~ A/"(0, 1), we take the critical (rejection) 
region 

C(a)"' =r = {uj : \R(n,T)-r\ > z a ^ A n V(n,T)} . (3.17) 



Proposition 3.7 Under (H), the tests (3.17) have the asymptotic level a for testing the 
null fiy, in the sense that 



A C n r T , P(A) > 



°(C(a)"' =r \A)->a 



(3.18) 



(above, P(. | A) is the usual conditional probability). They are also consistent for the 
alternative , in the sense that 



p(c(a)^ =r nnp) -+p(^ r ). 



(3.19) 



One constructs one-sided tests in the same way. For example, if we want to test the 
null hypothesis f^ r = U r '< r f2y against the alternative Q^, r = U r i >r ^l'! T , and if z' a is the 
one-sided a-quantile defined by P(<I> > z' a ) = a, we take the critical region 

C(a)^ r = {oj: R(n, T) > r + z' a y/A n V(n,T)} . (3.20) 
Exactly as above, one obtains the following proposition. 



Proposition 3.8 Under (H), the tests Ii3.20\) have the asymptotic level at most a for 
testing the null Qf, r , and indeed satisfy 

A c QJf r , P(A) > P(C(a)™' =r | A) -> oF(^ \ A) < a, (3.21) 



and are consistent for the alternative f2 



>r 
T ■ 



The tests for the null Vtj, r against QS r are obtained analogously. 



Remark 3.9 Let us link our testing procedure with some other statistical problems: 

a) In [5j [6] parametric estimation methods for the so called integrated diffusions have 
been developed. An integrated diffusion is a process that satisfies the first and the third 
equations of assumption (H) with a = 0, i.e. 

dX t = bfdt, 

where bt is a continuous ltd semimartingale. We refer to [5] for various applications of 
these models in natural sciences. Given high frequency observations of X, testing the 
null hypothesis of integrated diffusion versus the alternative of a diffusion with a present 
volatility part a is equivalent to testing f2j, versus 
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b) Another potential application of our method is a test for "perfect correlation" between 
the process X and the unobserved volatility u. The problem can be formulated as follows: 
Let X and a be two one-dimensional continuous Ito semimartingales of the form 

dX t = b t dt + atdWt, da 2 = atdt + v t dB t , 

where W and B are one-dimensional Brownian motions with the bracket process [W, B]t = 
pt, \p\ < 1- For financial applications testing the hypothesis \p\ = 1 versus \p\ < 1 is of 
certain interest. Note that \p\ = 1 appears in the SDE case, i.e. when at = g(Xt) with 
g G C 2 (M). We refer to testing local volatility hypothesis in [12] for a more detailed 
discussion (see also [H] for related statistical problems). The aforementioned problem 
is equivalent to testing Q)p versus fi^ 1 = Q, 2 -, for the two-dimensional process (X, a 2 ). 
Since the process a 2 is unobserved, it has to be locally estimated from the high frequency 
observations of X first (see e.g. [12] for more details). □ 



4 A test for a constant rank 

This section is devoted to a seemingly different topic, namely whether the a priori time- 
dependent rank is constant or not. Our test statistics will be based on a distance measure 
between the rank process rt and the maximal rank Rt, which vanishes if and only if the 
rank is constant almost surely. For the formal testing procedure we will need some limiting 
results for the "spot estimators" of the rank. By this, we mean estimators for rt, for any 
given t, at least under the assumption that r s is equal to rt for all s in some right or left 
neighborhood of t. 

To describe these spot estimators we pick a sequence k n > 1 of integers going to infinity, 
and such that k n A n — ?• (as for spot volatility estimators), and precise specifications for 
k n will be given later, although we always assume k n > Ad. For any integer i > 1 we set 

nn > lQ g S i an ^2d(i+l)fc„ A n ~ ^2dik n A n , . 

^ =d "loi2"' Si= ~^ T^M • ( 422 ) 

6 °2d(i+l)k n A n °2dik n A n 



Then Rf, more or less, plays the role of an estimator of the maximum of rt over an interval 
of length 2dk n A n around the time 2idA n , and we set for any p > 0: 

= 2dk n A n ztll dknAn] ~ 2 m kn \ p a (d+in ^ 

B(n,p,T) = A{p)^ - a(n,T)(R(n,T)) p , a(n,T) = 2dk n A n ([T/2dk n A n ] - l). 
The asymptotic results for the quantity B(n,p,T) are as follows. 

Theorem 4.1 Assume (H), and let T > 0, p > and k n be such that k n An — > oo and 
kn,A n y 0. 

a) If t i-)- r t {<jj) is continuous except at finitely many points on [0, T], hence piecewise 
constant, we have 

B(n,p,T) A I {r s fds-T{R T f. (4.24) 
J o 
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b) We have the stable convergence in law: 
1 

B(n,p,T) ^4 B(p,T) in restriction to the set fi^ n {R T > 1}, (4.25) 



VS.. 

where B(p,T) is defined on an extension (f], J 7 , (J^)^, P) of (J), J 7 , (J r t)t>o, IP) is, 
conditionally on F, a centered Gaussian variable with conditional variance V(p, T) = 
K(B(p,T) 2 | J 7 ) given on each set Qj, by 

V lo g 2 / A) V r r(cr s , <r,v 3) b 8 ) S(r) T J 

(4.26) 

with V(r) KK ' being defined at (|3.9I) . 

Notice that the right side of (|4.24|) is on the set fiy, and strictly negative on Jlji. 



Remark 4.2 The reader will notice that in the definition of A(p)f the summands are 
\R?\ P A(d + l) d , instead of the more natural We could take this more natural form 

for (b) above, but it is useful (and innocuous from a practical viewpoint) to "bound" the 
summands, in order to obtain (a). We could bound them by dP instead of (d + l) p and 
still have (|4.24p . but then f|4.25j) would then fail in case r = d is the maximal rank: we 
would obtain a CLT with a non-Gaussian and non-centered limit. □ 



Remark 4.3 In the setting of (b| above, we will in fact prove a joint convergence for the 
variables A{p)j, — a(n,T)r p and R(n,T) — r, both normalized by l/y/A n (the second one 
being as in (|3.1ip ). and from which (|4.25p follows. Such a joint CLT even holds under the 
assumptions of (a), with a complicated limit, but this refinement is not useful for us in 
this paper. 



Remark 4.4 One can also prove a joint convergence for the variables A(p)j, — a(n, T)r p 
with different values of p, and still normalized by A n . However, when p > p' > it 
turn out that the difference -4- (A(p)% - a{n,T) l - p '/ p (A{p')^) p / p ') converges to 0, and 
no known normalization gives a proper CLT. □ 



As before, we need consistent estimators for the conditional variance V(p,T). Such 
estimators are constructed in a way analogous to (|3.13p . That is, we set with k n as above: 

[t/2dA n ]-fc n -l 

TTn,nn' _ . ,2 \l+2d-2R{n,T) ST^ ( jOKn^n 1 \ 

i=0 b 2d(i+k n )A n a 2idA n T 

yU,K yTl,K ryU,K 

^ j.f A (2id+K)A n ~ Zj 2idA„ Zj (2id+Kd)A n ~ Z/ (2id+K(d~l))A n \ (4 27) 

V \fnKn V K &n ' 



x/( 



fy7l,K yU,K fyTl,K fyU,K 

Zj (2id+K')A n ~ Zj 2idA„ Zj (2id+K'd)A n ~ (2id+K' (d-l))A„ 

\/K'A n 
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Theorem 4.5 Assume (H), and let T > 0, p > and k n be such that k n An — > oo and 
k n A n — > 0. Then we have 



V(n,p,T) : = ( ^tgT' 1 ) 2 (Yt 11 + 2 2 ^ T )- d W n / 2 - 2 ^+^T)-dyn,12^ 

V(p,T) on the set fl^. 

Moreover, the variables 



(4.28) 



Z(n, P , T) = B{U ^ T) (4.29) 



'A n (V(n,p,T)A(l/^)) 
have the following asymptotic behavior, where ~ A^(0, 1) is as in Corollary 

C~ s 

Z(n,p,T) — > $ in restriction to the set fi^ n {Rt > 1} 
Z(n,p,T) — oo in restriction to the set Qj, 



(4.30) 



Having all instruments at hand we proceed with testing. What is easily available is a 
family of tests for the null 0^ , whereas the alternative is restricted to $7^ . One does not 
know how to test the null Vtfp. 

For this purpose we use the statistic B(n,p,T). In fact, (|4.30j) gives us the behavior 
of this statistic on 0^ D {Rt > 1} 5 and this is the null which is tested below. Now, is 
the union of n {Rt > 1} and tt T , so if we are interested in testing the whole one 
can do a double test, using what precedes and Proposition 13.81 with r = 0. 

We propose to use the following critical region, where p > is chosen arbitrarily and 
z' a is again the one-sided a-quantile of jV(0, 1): 



C(a) T '= = {w : B(n,p,T) < -z' a ^J A n (V(n,p,T) A (l/VA n ))}. (4.31) 
Exactly as in the previous section we obtain the following result. 



Proposition 4.6 Under (H), the tests {^.31 ) have the asymptotic level a for testing the 
null n {Rt > 1}> in the sense of \3. 18\) . and are consistent for the alternative O^. 



5 Proofs 

Before we start presenting the formal proofs, let us give the road map. Subsection 15.11 
demonstrates some technical results on expansions of determinants. They are applied in 
Subsection 15.21 to prove Lemma 13.11 This Lemma implies that the process S(r)t defined 
at (|3,6p is strictly positive on the set £l T , which is crucial for our method. 

The first main result of our paper is Theorem 13. 21 whose proof is rather involved. First, 
we will show that the standard localization procedure (see e.g. Section 3 in [T]) implies 
that all processes in (H) may be assumed to be bounded without loss of generality. This 
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first step considerably simplifies the stochastic treatment of various quantities. A second 
crucial step is the stochastic expansion explained in Remark l2.2t we have ()2. 13[) and (|2,14p . 
Subsection 15.31 deals with the formal justification of this expansion, for which we will use 
slightly different notation. 

It turns out that the stochastic order of the error term related to the decomposition 
(|2.13p . namely Op(A n ), is not sufficient to show its asymptotic negligibility. However, we 
will prove that the error terms are martingale differences, so they will not affect the stable 
central limit theorem at (|3.8p . A similar treatment will be required for the error term 
connected with the stochastic version of the expansion (|2.6p . 

The proof of Proposition 13.41 (consistent estimation of the asymptotic conditional co- 
variance matrix) is somewhat easier. Corollary 13.31 follows essentially from Theorem 13.21 
by the delta method for stable convergence. The proofs of these results are collected in 
Subsection 15.41 In particular, we apply a stable central limit theorem for semimartingales 
(see e.g. Theorem IX. 7.28]) to prove Theorem 13.21 

The proof of Theorems 14.11 and 14.51 which is presented in Subsection 15.51 is a bit 
more involved than one of Theorem 13.21 although the main techniques are similar. The 
additional difficulty comes from the fact that we need to use the stable convergence of 
Theorem [321 but for processes evaluated at random times. Corollary 13.61 and Propositions 
13.71 13.81 and 14.61 are straightforward consequences of the previous results. 

5.1 Expansion of determinants. 

We first prove some general and easy facts about determinants. Below \\A\\ denotes the 
Euclidean norm of a matrix A G Ai. 

For m > 1 we call V m the set of all multi-integers p = (pi , • • • , p m ) with p\ + - ■ ■ +p m = 
d, and I p is the set of all partitions I = ■ ■ ■ ,I m ) of {1, • • • , d} such that Ij contains 
exactly pj points (so Ij = if pj = 0) . If p € V m and I G X p and A\ , • • • , A m G M. , 
we write G\ ... A for the matrix whose ith column is the ith column of Aj when i G Ij. 
Letting A, B, C G M, we can rewrite (|2.5p as 

lr {A,B)= det ( G A,s), (5-1) 

and we set 

j' r (A,B,C)= det ( G W)- ( 5 - 2 ) 

— 1,1) 

In the following two lemmas we present some technical results on determinant expansions. 

Lemma 5.1 For any m > 1 and A%, ■ ■ ■ , A m G M we have 

det(Ai + -.-+4*)= E E det ( G ^-AJ- M 
pev m leip 
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Proof. Letting Sd be the set of all permutations of {1, • • • , d} and sign(s) be the signature 
of s G S, we have 

d 

det(A + B) = ^(-i)signW JJ(a a W,i + 6»(0.<) 

= Yl Y^ sisn{s) U aS{l) ' l Il bS{i) ' l = Yl det ( G Ai c) )- 

IC{l,-,d} seS d iel i$I lc{l,-,d} 



This readily implies that if (| 5 . 3 j) holds for some m, it also holds for m + 1. Since (|5.3p is 
obvious for m = 1, the result follows by induction on m. □ 

Lemma 5.2 There is a constant K such that, for all r = 0, • • • ,d, all h G (0, 1] and 
A,B,C,D £ M with rank(A) < r we have, with A = ||A|| + \\B\\ + ||C|| + and with 
the convention 7_i(A, B) = 0: 



det(A + + /i 2 C + /i 2 B) - /i d " r 7 r (A, B) - /i d " r+1 (7r-i(A B) + ^(A, B, C)) 

< Kh r - d+l A d ~ l {hk+\\D\\), 



(5.4) 

det(A + hB + h 2 C + h 2 D) 2 - lr (A, B) 2 

-2h lr {A,B){ lr _ 1 (A,B)+ 1 ' r {A,B,C))\ < KhA 2d - 1 (hA+ \\D\\). { ' 



Proof. Let p G V 4 and I G X p . Then det^^^^) = ^+2^3+2^3 det{G\ B CD) ) 
vanishes when p\ > r, and has absolute value smaller than Kh P2Jr2p3Jr2pi A d ~ P4 \\D\\ P4 . 
Then (15.4p readily follows from (15.3p . and by taking squares in (15.4p we deduce (15.5p . □ 



With the same notation, and if further A',B',C',D' G M with rank(y4') < r also and 
A' = \\A'\\ + ||B'|| + ||C"|| + ll-B'H, and h! G (0, 1], the same argument shows that 



* , det(A + hB + h 2 C + h 2 D) 2 det(A' + h'B' + h l2 C + h l2 D') 2 
{hh ) (5.6) 

-7 r (A,B) 2 7 r (A',B') 2 | < K(h + h')(AA') 2d . 



5.2 Proof of Lemma 13.11 

1) The results about T r (u). We write Vi and Vi for the d-dimensional variables 
whose components are respectively the d first and the d last components of *&(u, l)j, 
for which we can take W = W and W' = W 7 ', and we set ^4 = mat(Vi,--- ,Vd) and 
B = mat(Fi, • • • If AjWW = - W^, we have 

= ^ a lm AiW m , V\ = a l +Y P lm \W !m + ^ j lkm h l)km (W), (5.7) 

m=l m=l m,fc=l 

where each hi t i m is a function of the path of W. Note also that F r (u, 1) = j r (A,B) 2 . 

Assuming first that the rank of a is (strictly) smaller than r, we observe that the rank 
of A is also smaller than r, implying by (15. ip that 7 r (^4, B) = 0, hence T r (u) = 0. 
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Next we assume that the rank of a is r, and proceed to prove T r (u) > 0. We first 
simplify the problem as follows. The matrix /3 is invertible and the rank of j3~ 1 aa* f3~ l >* 
is r, so we can write f3~ 1 a = IIA, where IT € M is an orthonormal matrix and A 6 Ai is 
a diagonal matrix whose diagonal entries Xj satisfy Xj 7^ if j < r and Xj = otherwise. 
Then, setting V- = IT /T 1 ^ and V'j = Wfi'Wj, the sequence (V^V'j) has the form 
(|5.7p . upon replacing W' by II* W' (another Brownian motion) and u = (a,/3,7,a) by 
y/ = (a' , J,j' , a'), where J is the identity in .M and a' = Ti*f3~ 1 a = A and j''^ 1 = 
^ =1 (ff 'p-ymjmji an d a ' = IF/T^. Furthermore, A' = mat(V{, ■ ■ ■ ,V£) = IT A 
and B' = mat(Fi, ■ ■ ■ ,V' d ) = ITp^B, implying j r (A', B') = det(n*/3^ 1 ) 7r (A, B), which 
in turn yields T r (u) = dct ^^ I"V(j/), because det(II) = 1. 

In other words, it is enough to prove the result when j3 = J and a = A is diagonal as 
above, and below we assume this. The two matrices A and B can thus be realized as 



A 



( Ai$l 



B 



( T\ + Ql 



(5.8) 



where all <3?* and T*- are i.i.d. J\f(0, 1) and the variables 0*- are independent of the T^'s 
(note that we have incorporated the constant a 1 in each variable ©*•). 

Let J T be the class of all subsets of {1, ■ ■ ■ ,d} with r points. Since A,- 7^ if j < r 
and Xj = otherwise, we see that, if I = (I,I C ) with / = {j\ < ■■■ < j r } £ J r and 
I c = {j[ < ■ ■ ■ < jrf„ r }, we have det(G^ B ) = ejdet(Aj) det(Bj), where Aj and Bj are 
the r x r and (d — r) x (d — r) matrices with entries A l j m = A^ l,m and B l j m = B^'i ,r+l , and 
£7 takes values in { — 1, 1}. Thus 

■y r (A,B) = e/det(A/) det(S/). 

In this sum we single out the Fs which contain d, and those which do not, and for the 
former ones the product det(Aj) det(Bj) does not depend of the vector Yd- For those 
which do not contain d, we develop det(-Bj) along the last column, which involves the 
determinants of the matrices Bj i which are the restrictions of B to the last d — r lines 
except i, and to the column indexed by the complement 1° of /, except d. We thus get 



7r (A,B) = Z + ^2(-l) 



=1 



Z[ = e/det(A / )det(B/ ji ), (5.9) 

l€Jr,d<£I 



where Z and all Z[ and ® r d +l are independent of the vector T^. Since this random vector 
Trf has a density, it follows that the variable ^y r (A,B) also has a density, provided Z[ 7= 
a.s. for at least one value of i. 

At this stage, we observe that Z[ has exactly the same structure as j r (A,B), except 
that the dimension of each Bj^ is (d — r — 1) x (d — r — 1) instead of (d — r) x (d — r), 
and that the last column of the original problem has totally disappeared. We can repeat 
the argument, to obtain that Z[ has a density and is thus a.s. non-vanishing, as soon as 
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some similar quantity (where the last two columns of the original problem no longer show 
up) is a.s. non- vanishing. Then, after an obvious induction, we deduce that r ) r {A,B) has 
a density as soon as det(A/) ^ a.s. for I = {1, • • • , r}. 

However, since the entries of this last Aj are Aj$*- for i,j = l,--- ,r, and all those Aj 
are non zero, it is well known (and also a simple consequence of the previous proof, in 
which we develop det(-Aj) according to its last column and perform the same induction 
procedure) that det(Aj) has a density. This indeed shows us that ^y r (A,B) has a density, 
hence E(7 r (A, B) 2 ) > and the proof of the first part of (|3.5p is complete. □ 

2) The results about T' r (u) and r"(u). When the rank of a is smaller than r, we have 
seen that, with the previous notation, ^ r {A, B) = 0, hence also T' r (u) = T"(u) = 0. 

Next, we turn to the case when the rank of a is r. Exactly as in the previous proof, it 
suffices to show the result when u = (A, J, 7, o). Recalling that F r {u, 1) and F r {u, 2) have 
the same law, we have K((F r (u, 1) — F r (u, 2)) 2 ) = 2(T' r (u) — T"(u)) and thus the second 
part of (j3.5[) holds unless F r (u, 1) = F r (u, 2) a.s. 

With the previous notation, we have F r (u, 1) = j r (A, B) 2 , and also F r (u, 2) = 
^r{A, B) 2 , where A and B are given again by (|5.8j) with the same Aj's, and random 
vectors (<£j, Tj, Qj) having globally the same distribution as ($j, Tj, Qj) (which may of 
course be defined for j > d): these two families of vector are not independent, and we 
have in fact 

¥j = 7f ( ^'" 1 + <&2j) ' Tj = 7! ( T 2;-i + T 2;)> (s-io) 

plus a more complicated relation relating Qj with the Qj' for j' < 2j and the vector a. 
What we need to prove is then F(\j r (A,B)\ / |7 r (A,B)|) > 0. 

We have f)5.9|) . and also, by the same argument, 
lr (A,B) = Z + ^-i)^ ^ 1 ^ 1 ^ +Q r d +l )z' t , Z\= e I det(A I )det(^ I , i ), 

i=i V I<=Jr,d(£I 

where we have also used the second part of (|5.10p . Here, the vector has a density and 
is independent of all other terms showing in the above expression, and also independent of 
j r (A,B). Therefore, \ j r (A,B)\ / \ j r (A,B)\ almost surely on the set {Z\ / 0}. Now, ~Z\ 
is the same as Z[, upon replacing (A,B) by (A,B) everywhere, hence the previous proof 
shows that indeed 7^ a.s. This shows that in fact P(|7 r (^4, B)\ 7^ |7 r (A B)\) = 1, thus 
ending the proof of the second part of (|3.5j> . □ 

5.3 Some stochastic calculus preliminaries. 

We assume (H) and, by localization (see e.g. section 3 in [l]), we may also assume that all 
processes X t , at, b t , at, v t , a' t , v[, a", v" are uniformly bounded in (co,t). The constants 
are always written as K, or K p if we want to stress the dependency on an additional 
parameter p, and never depend on t,i,n,j. For any process Y, we use the following 
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simplifying notation: 

= ?2idA n , = Y2idA n ■ 

For all p,t,s > 0, we have by Bnrkholder-Gnndy inequality 

E( sup \Y t+u -Y t \P\ F t ) < K q s q / 2 MY = X, a, b, • 
ue[Q,s] 



We set 



Vt,. 



sup \\Y t+u -Y t \ 

itS[0,s], Y=a,v',v" 



2idA„ ,2dA n 



jrr, 



(5.11) 
(5.12) 

(5.13) 



Lemma 5.3 For all t > we have A n E( YaI™^ 1 V?) -> 0. 

Proof. It suffices to prove the result separately when Y = a or Y = v' or Y = v" . Set 
7™ = sup sg (Q 4rf A n ] \\Yt+ s — Yt\\ 2 , so K((t]^) 2 ) is smaller than E(7g) when i = and than 
qZA~ $2{i-V)dA w hen i > 1. Hence by the Cauchy-Schwarz inequality, 



( E ^t!( e ( a " E «) 2 )) ' <K t fa(is+ [ ad, 

i=l v<M . =Q Jo 



[t/2dA„]-l 



1/2 



We have 7" < iT, whereas the cadlag property of Y yields that 7™(w) — > for all uj, and 
all s except for countably many strictly positive values (depending on uj). Then, the claim 
follows by the dominated convergence theorem. □ 



The proof of Theorem 13.21 is based on a decomposition of the increments ZV^ 
Z(2id+K{j-i))A • ^ n or der to understand better this decomposition, we first deduce from 
(12. 8|) that, for any z < t < s, and with vector notation, 



(2id+K.j)A„ 

de ' 



b u du = ai + a2 + + a^, 



o~ u dW u = «5 + Q6 + aj + as + ag + «io + «n, 



where 

ai = b z (s - i), a 2 = ft ( /" <4 du^ du a 3 = J/(W„ - W z ) du, 
"4 = // ( ftW w - v' z ) dW w ^j du 

a 5 = a z (W s - W t ), a 6 = a z ft(u- z)dW u , a 7 = ft ( ft{a w - a z ) dw\ dW u 
as = v z f t s (W u - W z ) dW u , a 9 = ft ( ft ( ft < dr) dW^) d^ n 

«io = < ft ( JTO^ " W g ) dW w ) dW u , a n = // ( ft ( ft(rf - v" z ) dW r ) dW w ) dW u . 

A repeated use of the Burkholder-Gundy and Holder inequalities shows that, in view of 
our assumptions on the various coefficients, and for any p > 1: 



E(\ aj \ p \T z )< { 



' K p (s 


- z)p/ 2 


if 


J 


= 5 


K p (s 


-z) p 


if 


j 


= 1,8 


K p (s 


- zfP/ 2 


if 


j 


= 3,6, 10 


K p (s 


-z) 2 p 


if 


J 


= 2,9 


{ K p (s 


- z) 3p / 2 E( V P s _, | T z ) 


if 


J 


= 4,7,11 
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We can then apply the previous decomposition with z — HdA n , t — (2id-\- n(j — l))A n 
and s = (2id + nj)A n , and add the increment of the process X\ to obtain 

(2i d+jK )A n ~ (2i d+ 0-l)*)A n _ n K + y /^ nf + ^ ffi + ^ g»f (5 . 14) 



for k = 1,2, and where (explicitly writing the components) 

ra,K,Z _ 1 n,lm( W m _ W m ) 

m=l 

Pi,j - °i + K A n 2^ v i J(2id+ K (i-l))A n V ,/1/ s ~ W 2idA n ) aW s 
m,k=l 



d ^ 

+ 7^T E 17 m ^(2?d+Kj)A„ ~ W (2i<i+«Cj-l))A„) 



m=l 

' = ^ ( e ta J<ffi#-V (* - 2^A n ) ^ 

x m=l 
r(2id+Kj)A„ 
'(2id+K(j-l))A n 

r(2id+Kj) An 

l(2id+K(j-l))A n \J(2id+K(j-l))A n y ry u ~ yY 2idA n ) UlVV u)"-" s 



H,j («A„) 3 / 2 v 

x m=l 

/n,Zm r(2id+Kj)A„ /"rym _ wm \ J„ 
+ 2^ w j Jted+K^-lMA^^s l/,/ 2idA n J as 



m=l 

+ e 

m,/,fc=l 



and 8™'j is a remainder term, and for p > 1 we have the estimates when j < 2d if k = 1 



and j < d when k = 2 (recalling 7y^ s < if): 

E(ll<fll p + II^T + ll7ST|J7 , )<^p 



(5.15) 

E(||^f ||* | F?) < K p (Ar/ + ( V f) 2A P) < K p . 



We end these preliminaries with a lemma which compares S n,K for k = 1, 2 with the 
following processes: 



[i/2dA n ]-l 

S(r)r = 2dA n E 7rK' K ^r K ) 2 , where 



i=0 

It also compares V n > K > K of iH| with 

[t/2dA n ]-l 

V(r)?' K > K ' = 4d 2 A n 7r(A^ K ,B^ K ) 2 7r (A^ K ',B^ K ') 2 . (5.17) 

i=0 

Lemma 5.4 If r< (oj) < r identically for some r £ {0, • • • , d}, we have for k,k' = 1, 2; 

S n ' K - S{r) n ' K ) ^ (5.18) 



(5.16) 



/a; V( K A n ) d ^ 

and 

. , * . V n > K > 1 -V(r) n ' K ' K ' ^ (5.19) 
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Proof. We denote by £™ A the ith. summand in the definition fl2. 12f) of S™' K . Besides the 
matrices in (j5. 16[) . we also define 

i = m at( 7i i , ■ ■ ■ , 7i,d )» D i = mat (^,i > ■ ■ ■ > S i, d )■ 

We start with (15.181) . Applying (15.51) with /i = ^/KA^, the fact that each ^4™' K has at 
most rank r (because rt < r), and the estimates ()5. 15[) plus the Cauchy-Schwarz inequality, 
we obtain 



= lr{A n ^\ B n ^f + 2^A^C' K + C' K , where 



»n,K ( au.k 1 - ) n,K\f / aTI.k T->n,K\ , // *n,K 7->n,K ,-~<n,K 



E(|C K |) < AA n + Ay / A^E(r/™). 



In view of Lemma E3 Ej^" 1-1 C °- Since ^ = 2dA n ESSS^ 1 " 1 ^ 

it remains to prove that A n Ei=o <iA ™'' 1 C™ A 

For this purpose we use the decomposition £™ A = C 4 ' n ' K + Cf™'^ where (^ n ' K = E(£™' K I 
Tf). By Doob's inequality, (|5.15p and the fact that Cf n is J-T^i-measurable, we have 

[s/2dA„]-l [t/2dA„]-l 

£ C"))<2«E( E | C f)<£. 

— i=0 i=0 

Thus A n Ef=o dAn ' 1 Ci'™'^ 0) an d the result will hold if we can prove that C J ' n ' K = 0. 
We even prove the stronger statement that E((^ n ' K | Q w V J-f) = 0, where Q w is the 
c-field generated by the whole process W , and this is implied by 

I £ Z(r,d-r)i I' e ^(r-l,d-r+l)> I" ^ ^(r,d-r-l,l) =^ 

E(det(G^n,K B n, K ) det(G^„, K)B „, R ) | ^ V Jf) = (5.20) 
E(det(G* det(G*l K c n,«) | G w ' V F\ l ) = 0. 

i ' i i ' i 1 i 

The variables ' Z , and 7™^' / have the form $(w, (W(w) 2 idA„+t - W(w) 2 idA„)t>o), 

with $ a (0^' V Jf)<g> immeasurable function on Q, x C(R + ,R d ), where C(R+,R d ) is the 
set of all continuous Revalued functions on R+ and C d is its Borel <r-field for the local 

n,K,l 1 n,K 
y. '. ' nr $ = 7' ■ 



uniform topology. When $ = a™' K,i or $ = 7™' K,i , the map x 1— )■ <I>(a;,x) is odd, in the 



sense that — x) = $(a;, x), and it is even when $ = ft™'?' . 

In (I5.20p . the three variables det(G\ ), det(G* ), det(G I , 

i > i i ' i i 1 i 1 i 

associated with three functions ^I/', ^t" of the same type. What precedes yields that \l/ 
is even (resp. odd) if r is even (resp. odd), and both ^' and ^t" are even (resp. odd) if r 
is odd (resp. even). Consequently, the products and vpvp" are odd in all cases. Since 
the Q w V ^"-conditional law of (W^uAn+t ~ ^2idA n )t>o is invariant by the map x h-> — x 
on C(R + ,R d ), we deduce (IQOD . hence ([5718D holds. 
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Finally, we turn to (|5.19p . Let 6f be the iih summand in the right side of A3. 13H . for 
K, k' fixed. We can apply (|5.6p with h = y/ nA n and h = \J n'A n , and (|5. 14[) and ()5. 15[) 
again, to get 



E 



{KK'A 2 n ) d - r 



B i ) 



< K\/ A r 



(|5.19p follows, and the proof is complete. 



□ 



5.4 Proof of Theorem I3.2L Corollary I3.3L and Proposition 13.41 

1) Observe that, with the notation (13, 2h and (13. 3|) . and upon taking 

,.n ~ Q ,n h ny TTT W (2id+t)A n ~ W 2i dA n ==, W (2id+t)A n ~ W 2idA n 

Mi = V?i )0-,u» A j) W"t = -== , H/ f = -== ,(5.21) 

we have r ) r (A™' K , B™ ,K ) 2 = F(u?,k). We consider the two-dimensional variables £™ with 
components 

C' K = 2d ^A~ n { lr (Al K , B? K f - r r «)) , k = 1, 2. (5.22) 

Since is ^"-measurable, whereas the processes W and W above are independent of 
, we deduce from (I3.4p . and from (15. 15H for the estimate below, that 

4d 2 A n r;(u«) if K = K' ( 5 - 23 ) 

Ad 2 A n T^(uf) Mk^k' 

□ 



err i-^r) 



2) By (I5.18p . for Theorem l3.2l it is enough to prove the stable convergence U'(r) n ^4U(r), 
where U'(r) n is the two-dimensional process with components U'(r) n,K = } (S(r) n ' K — 

V A n 

S(r)) and the quantity S{r) n ' K is defined in (|5.16p . We have U'(r) n = Y n + Y' n , where 
[t/2dA„]-l 

y t n = e e 

[*/2dA n ]-l 

y *' n = vfc ( 2dA - £ r r ( ff f, 5, <, 6™) - /„' r r (c7 S , 5, 6.) d S 

and £™ is given in (I5.22p . Since the three processes <r, v, 6 are Ito semimartingales, whereas 
T r is a C°° function, it is well known that Y' n =l> (see e.g. section 8 in [I]). We are 
thus left to prove that 

r ^ W(r). (5.24) 

By virtue of the first two parts of (I5.23p . a standard CLT for triangular arrays of martingale 
(see [HJ Theorem IX. 7. 28]) increments shows that, for (I5.24p to hold, it suffices to show 
the next two properties: 

[t/2dA n ]-l 

E E^rer'i-^) nor' (5.25) 

i=0 
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[*/2<fA n ]-l 

H^(N {2{i+1)dAn -N 2idAn )\TT) AO (5.26) 

for all t > and for any bounded martingale N orthogonal to (W, W') and also for 
N = W m or N = W' rn for any m. 

The last part of (|5.23p and the cadlag property of a, v, b, plus the fact that T' r and T" 
are polynomials, immediately gives us (|5.25p by Riemann integration. 

The proof of (|5.26p is also standard: By construction, £™ is a two-dimensional variable 
of the form (W 2i dA n +t(u) - W 2 idA n (w))t>o, (W 2idAn+t (u) - W^ idAn (cj)) t > ) similar 
to the functions occurring in Lemma 15.41 and since in the definition of £f one takes 
squared determinants, all these functions <3? are globally even in the sense that 3>(w, x, y) = 
&(u), — x, — y) for any two d-dimensional functions x,y on M + . So, on the one hand, 
after multiplying the function $ corresponding to by x m (2dA n ) or y m (2dA n ), one 
gets an odd function, and (|5.26|) when N = W m or N = W' m follows. On the other 
hand, by the representation theorem one can write £f as the sum of two integrals over 
(2idA n ,2(i + l)dA n ] with respect to W and W' , for suitable predictable integrands; thus 
when N is orthogonal to W and W', the increment N^ 2 (i+i)dA n — ^2idA n has J^-conditional 
correlation with both those integrals, thus yielding (15.26P again. 

Therefore, the proof of Theorem 13.21 is complete. □ 



3) A simple calculation shows that 



1 i + vs;p(#/5(r)T 



R(n,T)-r=- -log " K n n ' K if S(r) T > 0, 

log 2 *l + ^U(r) n //S(r) T 

hence on the set £l r T . Since the sequence U(r)j, is tight, it follows from a Taylor expansion 
that 

1 — - /--> \nA tti \n,2\ 



on £l r T again. Then Corollary 13.31 follows from Theorem 13. 2| upon observing that the 
^-conditional variance of U(r)j, — U{r)\ is the numerator of the right side if (I3.12p . □ 

4) Now we turn to the proof of (I3.14p . and by (I5.19|) it suffices to prove the convergence 
of V(r) n ' K,K . We suppose that k = k' , the proof in the case k ^ k' being analogous. We 
set 

iff = ir(Al\ B^)\ nf = E( V ? I J?), r?r = rfi- ? f • 
As for ([5723]) . we deduce from ([33]) and (15T51) that 

C = 2d{r' r (uf) - rv(^) 2 ), E(\ v 'r\ 2 ) < K. 

On the one hand, the same argument as for proving (|5,25p shows that Ad 2 A n ^1=0^ 1 
converges in the u.c.p. sense to the right side of (|3.14j) (for k = k'). On the other hand, 
since rfl n is a martingale increment relative to the filtration (J 7 ")j>o, we deduce from 
Doob's inequality that Ad 2 A n Y^S^' 1 v" n ^ 0. We then deduce Q53ty . □ 
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5) Finally, for (I3,15p it is enough to show the convergence in probability in restriction to 
each set Qj,, for r = 0, • • • , d. For this we use the following convergence properties, which 
readily follow from (|3.8p . (13. lip , in restriction to the set W T : 

_L_ 5 M A S{t) t > 0, R(n, T) A r, 

together with ()3.14j) applied at time T, which also holds on Q r T . Then f|3. 15[) follows after 
a (slightly tedious) calculation, in view of the form (|3.12p of V(T) on £l r T : the proof of 
Proposition 13.41 is complete. □ 



5.5 Proof of Theorems 14.11 and 14.51 



We begin with a lemma. Its setting apparently extends the setting of the theorem to be 
proved, but this will be useful for the proof itself. The extension concerns the fact that we 
replace the non-random terminal time T by a stopping time, still denoted by T, which is 
positive and bounded. In this case, the notation (|2.3p still makes sense, as well as A(p)^ 
and a(n,T) and V^ KK ' , as given by (j4T23j) and (OTP . 

Lemma 5.5 Assume (H) and rt = r for all t < T with T a positive finite stopping time 
and r E {0, • • • , d}. Then for all p > 0, k, k' E {1, 2} and Q r s ' KK as in (|3.14j) we have 

A{pfr A Tr p (5.28) 
yn, K K' 2d [ T ( ^ ^ i- - 2 e r s > KK ' ds. (5.29) 



(««'A2)<*-r T J Q \r r (a s ,a,v s ,b s ) S(r) T , 

Moreover, if r > 1, the following stable convergence in law holds, where U{r) is defined in 



(5.30) 



Theorem \3. 



{u{r)^^{A{p)^-a{n,T)rP)) 



Proof. 1) Let -y t = r r (at, ct, u t) 6^), which is a continuous process, positive on [0,T] by 
Lemma [3.11 Thus T m = mATAinf(t : jt < 1/m) satisfies P(T m = T) — > 1 as m — > oo and, 
if any one of the claimed convergence holds for each T m (instead of T), it also holds for T. 
In other words, we can assume T < A and 1/jt — A for some constant A and all t E [0, T]. 
Moreover, T r is a polynomial, so the process jt is a continuous ltd semimartingale, and 
by localization again one can assume that for some other constant A' , 

E(| 7t+s - 7t | 2 ) <A's. (5.31) 

The sequence U (r) n converges in law toward a continuous process, so the moduli of continu- 
ity p(n,x) = sup(||£/(r)" +s — C7(r)"|| : t < A', \s\ < x) satisfy lim-40 limsup n F(p(n,x) > 
1) = 0, and thus with the simplifying notation w n = 2dk n A n we have 

P(fi„) -> 1, where Q n = {\\U{r)? +s - U(r)?\\ <lVt<A', s< w n ). (5.32) 
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2) Observe that 



-t^o log ff+^L , where 



log 2 ^Q + V^V- 

Qi = S(r) 2idAn+Wn - S{r) 2ldAn , rji' k = U(r) r ^ dAn+Wn - U{r) 2idAn . 

Recalling l/ 7t < A, and since Q = w n (j 2 idA n + pf), where E(|/3™| 2 ) < A"w n by (Eg) , 
one has 



1 



CP 72tdA„ 



(5.33) 



Moreover, take a E (0,1/2) such that 2| lo f (1 2 a)l < 1/2. For n large enough we have 

Ay/An jw n < a because /c n A n — > oo. In this case, in restriction to the set O n , for all 
i < [T/2dA n ] — k n — 1 we have with a constant K (varying from place to place below): 



— n 

nVi 



r > 1 



- w n - 2' 



^-j n,l ri.1 



log 2 C? 



< K 



R"- 



1 



< 



K^/A^ a 1 



(5.34) 



3) Recalling (|4.23p and T < A, when r = the last estimate above yields 

A p/2 
Wn 

which goes to because k n y/ A n — > oo. Thus in view of (I5.32|) one gets (15.28P when r = 0. 

4) At this stage, we start proving (|5.30p . and thus assume r > 1. We observe that 
1 



[T/w n }-2 



(A(pK-a(n,T)rt>)=w T 



C, wheretf = y =(\B? k jA(d+iy-r*>). 

i=0 V n 



(|5.34p implies that for n large enough, < d+1 (recall r < d), hence a Taylor expansion 
of the function x \— > \r + x\ p — r p imply, again for n large enough: 



„ Pr^ 1 V ikn ~ %g 



n,l n,2 



^A 



< K — on J7 n and for i < [T/w n ] — 2. 



log 2 S„ 

Upon using (I5.33p . and by the Cauchy-Schwarz inequality, it follows that 



E 



n,l n,2 



log 2 U7 n 7 (i _ 1)t 



w 



hence 



E(\Y n -Y;\l Qn ) -+0, where ^ 



„_1 [?>„]-2 n,l n,2 

, _ prP ^ ?? ijfcn - r? ifcn 



log 2 



E 

i=0 



7(i-i)i 
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because k n An^ — > oo. Recall also (|5.27p . Then, by virtue of (|5.32p . the convergence (|5.30p 
will follow from 

(U(r)-X) (U(r)T,y), y = ^ — (dU(r)\-dU(rf s ). (5.35) 

log 2 J T r (a s ,a,v s ,b s ) 



5) By Theorem VI. 6. 15 of [8] it follows from (15.23P and (|5.25p that not only does the 
sequence of processes U{r) n converge in law, but it also enjoys the so-called P-UT prop- 
erty (predictable uniform tightness). By a trivial extension of Theorem VI. 6. 22 in [8], 
this implies that if a sequence H n of adapted cadlag two-dimensional processes on O 

is such that the pair (U{r) n ,H n ) ^4 (U(r),H) (functional convergence for the Sko- 
rokhod topology), the bi-dimensional processes (U \r) n , J* * Hf_ dU (r)™) converge stably 
in law to (W(r), jt H s - dll(r) s \ and since U{r) is continuous and T is .F-measurable, 
this in turn implies the stable convergence of the variables (l/(r)£,/ Hf_dU(r)™) ±4 
(U(r)T, Jq H s - dU(r)s). At this point, (|5.35|) follows, by taking the processes H n and H 
with components 

h; " 1 = ~ ar = ift 6 w-'K.^ah, m = -h? = 5^ 

C— s 

(Note that the joint stable convergence (U(r) n ,H n ) ^4 (K(r),H) holds because 1/jt is 
continuous.) This ends the proof of (I5.30p . 

6) Since (|5.30p implies (|5.28p when r > 1, we are left to prove (|5.29p . We fix k,k'. Our 
first observation is that, since U(r)1^ — > M{t)t follows from (13. 8p as seen before, the proof 
of (]3.1ip carries over to the case T is a stopping time. Therefore (R(n, T) — r) log A n — > 
because here Q^, = f2, and thus A^™'^ T 1. It follows that ()5.29p amounts to proving 
the same result for the variable V^' KK which is the same as V^ KK except that in front of 
the sum we substitute A^ +M_2i?(n ' T) with A l n +2d ~ 2r . 

With Of being as for Proposition 13.4^ the ith summand in the right side of (|3.13p . we 

have 

2 , Ai _ r xj [T/2dA n ]-k n -l 

Vt'™ = £ \ %tr-] B{j)h where B(j)$ = 4d 2 A n £ v(j)fOf 

3=0 V bT J 2 *=o (5.36) 



Combining (I5.33P and (I5.34p . we obtain for i < [T/2dA n ] — k n — 1 and all n large enough: 



E 



2idA r 



2 



2 



l2idA r 



Since by localization we may assume that the processes at,vt, bt are bounded, we may also 
assume Of < K, and upon using ()5.32|) once more, we then deduce that B(j)j, as the same 
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asymptotic behavior as B'{j) T which is given by the same formula, with v{j)™ substituted 
with the following variables v'(j)f: 

v'(0)?= 1 = _ 2 -J— , t/(2)? = l. 

(72idA n J 72idA„ 

This allows us to get, with T(0) = 4*, T(l) = ± and T(2) = 1: 

(^)d-r 5 Wt 2d jf T(i),er K ' ^ (5.37) 

(indeed, the case j = 2 is (I3,14h . and the other two cases follow from a standard argument, 
similar to Step 5 above, but simpler because the integrand is a cadlag bounded process not 
depending on n, and 0? > 0). Using further TA^~ r /Slp -A T/S{r) T and recalling (P6|) . 
and upon expanding the square in the right side of (|5.29|) . we obtain this convergence and 
the lemma is proved. □ 



— p 

Proof of (a) of Theorem 14.11 Since R(n,T) — > Rt by (|3.1ip . it suffices to prove that 

A{p)» A A(p) T := f T (r s yds. (5.38) 
J o 

The assumption implies the existence of a sequence of stopping times r,- increasing to 
infinity, such that tq = and Tj < Tj+i if Tj < oo, and such that the process rt takes a 
constant (random) value p(j) on the time interval Jj = (tj_i,Tj), with p(j) / p(j + 1) if 
< Tj < oo. In view of the discussion preceding (I2.3h . the values r Tj . is necessarily smaller 
than or equal to p(j) A p(j + 1), but is irrelevant to our discussion. We also denote by Nt 
the biggest j such that tj < T. 

With an empty sum being set to 0, we have 

Nt [(Tj AT) /w n ]-2 

A(p)^ = J^Y(j) n + Z n , Y(j) n = w n £ I^T' 

J=l i=[Tj-i/w n ]+l 

and where Z n is the sum of at most 3Nt terms of the form w n {\R™\ p A (d+ l) p ). Since Nt 
is finite and w n — > 0, we have — > (pointwise), and it suffices to show that for each 
j > 1 we have 

A ^(3) == ((TAtj) - (TA Tj -i))p(jy. (5.39) 

We then fix j. The variable Y(j) n is the process A(p) n evaluated at time Tj = TArj — ta—\ 
relative to the underlying process X(j) t = X Tj _ 1+t , up to at most two border terms. We 
thus might be tempted to apply f|5.28|) right away, and indeed X(j) satisfies (H) for the 
filtration J~(j)t = J~r j ^ 1 +t, relative to which Tj is a positive bounded stopping time. There 
are, however, a few problems to overcome: 

1. The rank rt(X(j)) associated with X(j) is equal to p(j) for all t 6 (0,7}), but not 
necessarily for t = 0, nor for t = Tj] 
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2. This rank p{j) is random, albeit J-"(j)o-nieasurable; 



To solve these problems we fix e > and consider the process X(j,e) t = X Tj _ 1+e+t , 
satisfying (H) for the filtration J r (j,e)t = F Tj _ 1+£+ t-, and the e)j-stopping time 
T(j,e) = T A tj — T A 7j_i — 2e. The associated rank is thus p{j) for all t € [0,T(j, e)], 
and we will show that if A(p,j, e) is associated with e) by ()4.23p . we have 

A (pJ^)tu,s) T(j,e)pUT = (TAT 1 -TAr,. 1 -2e)pUr. (5.40) 

Indeed, it suffices to prove this in restriction to each set fl' r = {p{j) = r} satisfying 
P(r^,) > 0. If P r denotes the (usual) conditional probability P(- | Q' r ), the process X(j, e), 
on the space (Q' r , F C\ W r , (F{j,e)t H fi' r ),P r ), still satisfies (H) and the associated rank 
is now r on the time interval [0, T(j, e)]. Then Lemma 15.51 yields the convergence (|5.40p 
under P r , hence under P in restriction to each Q' r , hence under Ponfl itself. 

Finally, the difference Y(j) n — A(p,j,e)^,^ £ ^ is a sum of at most 2[e/w n ] terms, each 
one smaller than w n (d + so this difference is smaller than Ke, as is the difference 
between the two right sides of ()5.39|) and (|5.40p . Hence (|5.39|) follows from (|5.40p . by 
taking first n — > oo and then s — > 0. This completes the proof. □ 



Proof of (b) of Theorem 14.11 Exactly as in the previous proof, it is enough to prove 
the result when rt = r > 1 identically, for some non-random r G {1, • • • , d}. By a standard 
localization procedure we can assume that T r (at, <?, vt, bt), which is positive everywhere, is 
bounded from below by a constant 1/A with A > 0, so the assumptions of Lemma 15.51 are 
satisfied. Therefore, (I5.27P and (I5.30P yield that, with Y n and y as in the proof of Lemma 
15.51 and with Z n = -7= (R(n,T) — r) and Z = log2 (^(Ot — ^( r )r)i we have 

(Y n ,Z n ) ^ (y,Z). (5.41) 



Then we obtain 



B(n,p,T)=Y n + 



a(n, T) 
v 7 ^ 



Ir + y/A^ZJ*). 



On the one hand, a(n, T) — > T. On the other hand, since Z n converges in law and 
r > 1, we have by the mean value theorem 



'An 



+ y/A^Z n \ p ) +pr p ~ 1 Z n ^0. 



Hence (|5.4ip yields 



'A 



B(n,p,T) ^4 B(p,T) = y -Tpr p ~ l Z. 



The pair (y, Z) being ^-conditionally centered Gaussian, the same is true of B(p, T), and 
the form (|4.26p of its conditional variance is easily checked, by virtue of (|3.9p . □ 
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Proof of Theorem 14. 5L It is easy to construct a process X' which satisfies the assump- 
tions of (a) of Theorem 14,11 and such that X[ = Xt for all t < T on the set Oy. Then 
on this set B(n,p,T) is the same, when constructed upon X or upon X' , and thus it 
converges in probability on this set to a strictly negative variable. On the other hand, 
Z(n,p,T) is B(n,p,T) divided by a quantity which by construction is smaller than A^/ 4 . 
Then the convergence Z(n,p,T) — > — oo on is clear. 

It suffices to prove (|4.28p on the set n £1^ for any r G {0, • • • , d} such that P(f2^) > 
0. For this we can argue under the conditional probability P r = P(- | {ro = r}), or 
equivalently suppose that we have in fact ro = r. As above, one can construct a process 
X' which satisfies the assumptions of Lemma [5.51 for some stopping time T' which satisfies 
T' > T on the set and we can apply (|5.29p to X' and the stopping time T' A T. This 
gives us (|5.29p for X, in restriction to the set Q^. 

At this point, (|4.28p follows from (|5.29j) by exactly the same calculations as ()3.15p 
follows from (|3.14|) . 

Finally, since V(n,p, T) V(p, T) on fiip, we have Z(n,p, T) = B(n,p, T)/ \J A n BV(n, p, T) 
on a set 0" whose probability goes to 1. The first part of (I4.30P than follows from (14.251) 
and (|4.28p by delta method for stable convergence. □ 



5.6 Proof of Corollary 13.61 

The same stopping argument as in Step 2 of the previous proof allows us to show that, 
without assumptions on the rank process rt, the stable convergence in law (13. 8|) holds in 
restriction to the set fiy, as soon as we restrict our attention to the time interval [0, T]. 

At this stage, the claim of Corollary 13.61 follows from ()3.11j) . an application of the delta 
method, (|3.15p and classical properties of stable convergence in law. □ 



5.7 Proof of Propositions [321 D and l4~6l 

(I3.18P is an obvious consequence of the stable convergence A3. 16f> . For the alternative- 
consistency, it suffices to prove that for any r' / r we have 

P(C(a)"' =r n n£) -»• P(^'). (5.42) 

On the set Qj, we have S(n,T) 2 d ~ r ' , and by ([3.15P the variables V(n,T) converge 
in probability to a limit which is [0, oo)-valued (actually, it is a.s. positive, but we do not 

use this fact here), so that A n V(n, T) — > 0. Since r ^ r, (|5.42p readily follows from the 
definition of C(a)^'~ r . 

Propositions 13.81 and 14.61 are proved analogously, the alternative-consistency in the 
latter case following from the second part of (|4.30p . □ 
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